DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 38

1
C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval ...
BASE
Show details
2
Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models ...
BASE
Show details
3
The Multilingual TEDx Corpus for Speech Recognition and Translation ...
BASE
Show details
4
An Information Retrieval Test Collection for English SMS Conversations
BASE
Show details
5
Microblogging Temporal Summarization: Filtering Important Twitter Updates for Breaking News
Xu, Tan. - 2015
Abstract: While news stories are an important traditional medium to broadcast and consume news, microblogging has recently emerged as a place where people can dis- cuss, disseminate, collect or report information about news. However, the massive information in the microblogosphere makes it hard for readers to keep up with these real-time updates. This is especially a problem when it comes to breaking news, where people are more eager to know “what is happening”. Therefore, this dis- sertation is intended as an exploratory effort to investigate computational methods to augment human effort when monitoring the development of breaking news on a given topic from a microblog stream by extractively summarizing the updates in a timely manner. More specifically, given an interest in a topic, either entered as a query or presented as an initial news report, a microblog temporal summarization system is proposed to filter microblog posts from a stream with three primary concerns: topical relevance, novelty, and salience. Considering the relatively high arrival rate of microblog streams, a cascade framework consisting of three stages is proposed to progressively reduce quantity of posts. For each step in the cascade, this dissertation studies methods that improve over current baselines. In the relevance filtering stage, query and document expansion techniques are applied to mitigate sparsity and vocabulary mismatch issues. The use of word embedding as a basis for filtering is also explored, using unsupervised and supervised modeling to characterize lexical and semantic similarity. In the novelty filtering stage, several statistical ways of characterizing novelty are investigated and ensemble learning techniques are used to integrate results from these diverse techniques. These results are compared with a baseline clustering approach using both standard and delay-discounted measures. In the salience filtering stage, because of the real-time prediction requirement a method of learning verb phrase usage from past relevant news reports is used in conjunction with some standard measures for characterizing writing quality. Following a Cranfield-like evaluation paradigm, this dissertation includes a se- ries of experiments to evaluate the proposed methods for each step, and for the end- to-end system. New microblog novelty and salience judgments are created, building on existing relevance judgments from the TREC Microblog track. The results point to future research directions at the intersection of social media, computational jour- nalism, information retrieval, automatic summarization, and machine learning.
Keyword: Information Filtering; Information science; Machine Learning; Microblog; Social Media; Temporal Summarization
URL: http://hdl.handle.net/1903/18139
https://doi.org/10.13016/M2Q777
BASE
Hide details
6
Frontiers, Challenges, and Opportunities for Information Retrieval – Report from SWIRL 2012, The Second Strategic Workshop on Information Retrieval in Lorne
Kelly, Diane; Clarke, Charles L.A.; Moffat, Alistair. - : KTH, Teoretisk datalogi, TCS, 2012. : ACM, 2012
BASE
Show details
7
Formative Evaluation for Multilingual Multimedia Search and Sense-Making
BASE
Show details
8
The enduring spoken word : [comment on D. W. Oard: "Unlocking the potential of the spoken word", 26 September 2008, p.1787]
In: Science. - Washington, DC : AAAS, American Assoc. for the Advancement of Science 323 (2009) 5917, 1010-1011
BLLDB
Show details
9
Advances in Multilingual and Multimodal Information Retrieval : 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers
Jijkoun, Valentin; Mandl, Thomas; Müller, Henning. - Berlin, Heidelberg : Springer Berlin Heidelberg, 2008
UB Frankfurt Linguistik
Show details
10
Unlocking the potential of the spoken word : advances in speech processing may soon place speech and writing on a more equal footing, with broad implications for many aspects of society
In: Science. - Washington, DC : AAAS, American Assoc. for the Advancement of Science 321 (2008) 5897, 1787-1788
BLLDB
Show details
11
Combining Evidence from Unconstrained Spoken Term Frequency Estimation for Improved Speech Retrieval
BASE
Show details
12
Classifying Attitude by Topic Aspect for English and Chinese Document Collections
Wu, Yejun. - 2008
BASE
Show details
13
Overview of the CLEF-2006 cross-language speech retrieval track
In: Oard, Douglas W., Wang, Jianqiang, Jones, Gareth J.F. orcid:0000-0003-2923-8365 , White, Ryen W., Pecina, Pavel, Soergel, Dagobert, Huang, Xiaoli and Shafran, Izhak (2007) Overview of the CLEF-2006 cross-language speech retrieval track. In: CLEF 2006: Workshop on Cross-Language Information Retrieval and Evaluation, 20-22 Sept. 2006, Alicante, Spain. (2007)
BASE
Show details
14
Investigating cross-language speech retrieval for a spontaneous conversational speech collection
In: Inkpen, Diana, Alzghool, Muath, Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Oard, Douglas W. (2006) Investigating cross-language speech retrieval for a spontaneous conversational speech collection. In: HLT-NAACL 2006 - The Human Language Technology Conference - North American Chapter of the Association for Computational Linguistics Annual Meeting, 8-9 June 2006, New York, USA. (2006)
BASE
Show details
15
The Effect of Bilingual Term List Size on Dictionary-Based Cross-Language Information Retrieval
In: DTIC (2006)
BASE
Show details
16
TREC-9 Experiments at Maryland: Interactive CLIR
In: DTIC (2006)
BASE
Show details
17
COMPLEX QUESTION ANSWERING BASED ON A SEMANTIC DOMAIN MODEL OF CLINICAL MEDICINE
BASE
Show details
18
Comparing User-Assisted and Automatic Query Translation
In: DTIC AND NTIS (2005)
BASE
Show details
19
Spontaneous speech processing
Furui, Sadaoki (Hrsg.); Beckman, Mary E. (Hrsg.); Hirschberg, Julia (Hrsg.)...
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 12 (2004) 4, 349-445
BLLDB
Show details
20
Interactive cross-language document selection
In: Information Retrieval Journal. - Dordrecht [u.a.] : Springer Science + Business Media B.V. 7 (2004) 1-2, 205-228
BLLDB
Show details

Page: 1 2

Catalogues
1
0
0
0
0
0
0
Bibliographies
4
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
33
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern